Search CORE

Identification of properties important to protein aggregation using feature selection

Author: A Caflisch
A Esteras-Chopo
A Mudher
AM Fernandez-Escamilla
AP Pawar
C Russell Middaugh
David Tai
F Chiti
F Rosenblatt
F Rousseau
G von Heijne
GG Tartaglia
GG Tartaglia
H Nielsen
H Nielsen
H Nielsen
H Olzscha
J Tian
J Tyedmers
JH Friedman
Jianwen Fang
KF Dubay
L Breiman
L Breiman
L Tjernberg
M Haenlein
M Kuhn
M Prabhakaran
MJ Thompson
MP Badtke
MW West
N Friedman
O Conchillo-Sole
OV Galzitskaya
OV Galzitskaya
OV Galzitskaya
P Baldi
R Diaz-Uriarte
R Nelson
S Guyon
S Kawashima
S Ventura
Shan Gao
SO Garbuzynskiy
TM Cover
VN Vapnik
WF Weiss
XL Wang
Y Freund
Yaping Fang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Repositório Aberto da Universidade do Porto

MIRRAGGE – Minimum Information Required for Reproducible AGGregation Experiments

Author: Bednarikova Z
Burdukiewicz M
Figueiredo F
Galzitskaya OV
Gazova Z
Gomes CM
Macedo-Ribeiro S
Martins PM
Navarro S
Otzen DE
Pastore A
Pereira PJB
Pinheiro F
Pinto M
Serpell LC
Silva A
Skrabana R
Smirnovas V
Sárkány Z
Ventura S
Ziaunys M
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2020
Field of study

Reports on phase separation and amyloid formation for multiple proteins and aggregation-prone peptides are recurrently used to explore the molecular mechanisms associated with several human diseases. The information conveyed by these reports can be used directly in translational investigation, e.g., for the design of better drug screening strategies, or be compiled in databases for benchmarking novel aggregation-predicting algorithms. Given that minute protocol variations determine different outcomes of protein aggregation assays, there is a strong urge for standardized descriptions of the different types of aggregates and the detailed methods used in their production. In an attempt to address this need, we assembled the Minimum Information Required for Reproducible Aggregation Experiments (MIRRAGGE) guidelines, considering first-principles and the established literature on protein self-assembly and aggregation. This consensus information aims to cover the major and subtle determinants of experimental reproducibility while avoiding excessive technical details that are of limited practical interest for non-specialized users. The MIRRAGGE table (template available in Supplementary Information) is useful as a guide for the design of new studies and as a checklist during submission of experimental reports for publication. Full disclosure of relevant information also enables other researchers to reproduce results correctly and facilitates systematic data deposition into curated databases.This work was supported by (i) the European Regional Development Fund (ERDF) through the COMPETE 2020—Operacional Programme for Competitiveness and Internationalisation (POCI), Portugal 2020, and by Portuguese funds through FCT—Fundação para a Ciência e a Tecnologia (FCT/MCTES) in the framework of grants POCI-01-0145-FEDER-031173, POCI-01-0145-FEDER-007274, POCI-01-0145-FEDER-031323 (“Institute for Research and Innovation in Health Sciences”), UID/Multi/04046/2013 (BioISI) and PTDC/NEUNMC/2138/2014 (to CMG). SV was funded by the Spanish Ministry of Economy and Competitiveness (BIO2016-78310-R) and by ICREA (ICREA-Academia 2015). ZG and ZB were funded by Slovak research agentures VEGA 02/0145/17, 02/0030/18 and APVV-18-0284. RS was funded by VEGA 02/0163/19. DEO was funded by the Lundbeck Foundation (grant no. R276-2018-671) and the Independent Research Foundation Denmark | Natural Sciences (grant no. 8021-00208B). AP research was supported by UK Dementia Research Institute (RE1 3556) and by ARUK (ARUK-PG2019B-020)

Improving the performance of DomainDiscovery of protein domain boundary assignment using inter-domain linker index

Author: A Andreeva
Abdur R Sikder
Albert Y Zomaya
AR Sikder
FMG Pearl
G Pollastri
G Pollastri
HM Berman
J Cheng
J Liu
J Sim
JE Gewehr
L Kong
M Dumontier
M Suyama
N Nagarajan
OV Galzitskaya
RA George
RL Marsden
S Veretnik
SF Altschul
SJ Wheelan
T Joachims
TA Holland
V Vapnik
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: Knowledge of protein domain boundaries is critical for the characterisation and understanding of protein function. The ability to identify domains without the knowledge of the structure – by using sequence information only – is an essential step in many types of protein analyses. In this present study, we demonstrate that the performance of DomainDiscovery is improved significantly by including the inter-domain linker index value for domain identification from sequence-based information. Improved DomainDiscovery uses a Support Vector Machine (SVM) approach and a unique training dataset built on the principle of consensus among experts in defining domains in protein structure. The SVM was trained using a PSSM (Position Specific Scoring Matrix), secondary structure, solvent accessibility information and inter-domain linker index to detect possible domain boundaries for a target sequence. RESULTS: Improved DomainDiscovery is compared with other methods by benchmarking against a structurally non-redundant dataset and also CASP5 targets. Improved DomainDiscovery achieves 70% accuracy for domain boundary identification in multi-domains proteins. CONCLUSION: Improved DomainDiscovery compares favourably to the performance of other methods and excels in the identification of domain boundaries for multi-domain proteins as a result of introducing support vector machine with benchmark_2 dataset

Springer - Publisher Connector

Exploiting heterogeneous features to improve in silico prediction of peptide status – amyloidogenic or non-amyloidogenic

Author: A Caflisch
AE Eiben
I Levner
J Cui
J Han
J Tian
KE Marshall
KK Frousios
KS Hareesha
L Goldschmidt
M López de la Paz
MJ Thompson
NV Subba Reddy
O Conchillo-Solé
OV Galzitskaya
P Han
P Moscato
S Bandyopadhyay
S Kawashima
Smitha Sunil Kumaran Nair
SO Garbuzynskiy
SSK Nair
SSK Nair
VS Mathura
WL Huang
Y Peng
Y Saeys
Z Zhang
Z Zhu
ZR Li
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Springer - Publisher Connector

A microscale protein NMR sample screening pipeline

Author: AK Dunker
Asli Ertekin
Clemens Anklin
CS Goh
D Lee
D Pantazatos
G Spraggon
G. V. T. Swapna
Gaetano T. Montelione
J Liu
J Prilusky
James M. Aramini
JJ Ward
JK Everett
JL Markley
JM Aramini
John K. Everett
K Coeytaux
K Peng
Keith Hamilton
Kenith Conover
KW Plaxco
LE Kay
LM Iakoucheva
M Jansson
M Kakuta
NA Farrow
OV Galzitskaya
P Bertone
P Hentschel
P Rossi
Paolo Rossi
PE Wright
Q Zhang
R Linding
R Linding
Rong Xiao
S Sharma
SW Englander
TB Acton
Thomas B. Acton
VL Woods Jr
W Peti
W Peti
Yuanpeng J. Huang
Z Dosztanyi
ZR Yang
Publication venue: Springer Netherlands
Publication date: 01/01/2009
Field of study

As part of efforts to develop improved methods for NMR protein sample preparation and structure determination, the Northeast Structural Genomics Consortium (NESG) has implemented an NMR screening pipeline for protein target selection, construct optimization, and buffer optimization, incorporating efficient microscale NMR screening of proteins using a micro-cryoprobe. The process is feasible because the newest generation probe requires only small amounts of protein, typically 30–200 μg in 8–35 μl volume. Extensive automation has been made possible by the combination of database tools, mechanization of key process steps, and the use of a micro-cryoprobe that gives excellent data while requiring little optimization and manual setup. In this perspective, we describe the overall process used by the NESG for screening NMR samples as part of a sample optimization process, assessing optimal construct design and solution conditions, as well as for determining protein rotational correlation times in order to assess protein oligomerization states. Database infrastructure has been developed to allow for flexible implementation of new screening protocols and harvesting of the resulting output. The NESG micro NMR screening pipeline has also been used for detergent screening of membrane proteins. Descriptions of the individual steps in the NESG NMR sample design, production, and screening pipeline are presented in the format of a standard operating procedure

Springer - Publisher Connector

Improved general regression network for protein domain boundary prediction

Author: A Ceroni
A Vieira
Abdur R Sikder
AK Jain
Albert Y Zomaya
AR Sikder
AR Sikder
Bing Bing Zhou
C Chothia
C Civera
CC Lee
CR Robinson
DB Wetlaufer
FMG Pearl
G Pollastri
G Pollastri
HC Van Leeuwen
HM Berman
J Chen
J Cheng
J Liu
J Sim
JCB Melo
JE Gewehr
JS Richardson
JSR Jang
M Dumontier
M Dumontier
M Suyama
MJ Lehtinen
N Nagarajan
OV Galzitskaya
P Baldi
P Bork
Paul D Yoo
RA George
RE Schapire
RL Marsden
RR Copley
RR Joshi
RS Gokhale
S Prompramote
S Veretnik
SF Altschul
TA Holland
Y Freund
Publication venue: BioMed Central
Publication date: 13/02/2008
Field of study

Background: Protein domains present some of the most useful information that can be used to understand protein structure and functions. Recent research on protein domain boundary prediction has been mainly based on widely known machine learning techniques, such as Artificial Neural Networks and Support Vector Machines. In this study, we propose a new machine learning model (IGRN) that can achieve accurate and reliable classification, with significantly reduced computations. The IGRN was trained using a PSSM (Position Specific Scoring Matrix), secondary structure, solvent accessibility information and inter-domain linker index to detect possible domain boundaries for a target sequence. Results: The proposed model achieved average prediction accuracy of 67% on the Benchmark_2 dataset for domain boundary identification in multi-domains proteins and showed superior predictive performance and generalisation ability among the most widely used neural network models. With the CASP7 benchmark dataset, it also demonstrated comparable performance to existing domain boundary predictors such as DOMpro, DomPred, DomSSEA, DomCut and DomainDiscovery with 70.10% prediction accuracy. Conclusion: The performance of proposed model has been compared favourably to the performance of other existing machine learning based methods as well as widely known domain boundary predictors on two benchmark datasets and excels in the identification of domain boundaries in terms of model bias, generalisation and computational requirements. © 2008 Yoo et al; licensee BioMed Central Ltd

Michigan Technological University

Rules Governing Selective Protein Carbonylation

Author: A Fredriksson
A Shevchenko
A Temple
Adrien Ducret
Andreas Hofmann
B Desnues
BS Berlett
BS Shastry
C Job
CN Oliver
DA Bota
DJ Selkoe
E Cabiscol
E Maisonneuve
E Wolf
Emmanuel Talla
ER Stadtman
Etienne Maisonneuve
F Ferron
G Reverter-Branchat
H Mirzaei
H Mirzaei
H Mirzaei
I Callebaut
I Dalle-Donne
J Bourhis
J Liu
J Mostertz
J Prilusky
JM Chandonia
JR Requena
K Coeytaux
M Bucciantini
OV Galzitskaya
P Lieutaud
Pierre Khoueiry
R Linding
RL Levine
RL Levine
RL Tatusov
RL Tatusov
RS Sohal
S Dukan
S Dukan
S Dukan
S Lee
Sabrina Lignon
Sam Dukan
SF Altschul
SO Garbuzynskiy
Sonia Longhi
T Grune
T Nystrom
Z Dosztanyi
Z Obradovic
Z Obradovic
ZR Yang
Publication venue: Public Library of Science
Publication date: 01/01/2009
Field of study

BACKGROUND:Carbonyl derivatives are mainly formed by direct metal-catalysed oxidation (MCO) attacks on the amino-acid side chains of proline, arginine, lysine and threonine residues. For reasons unknown, only some proteins are prone to carbonylation. METHODOLOGY/PRINCIPAL FINDINGS:we used mass spectrometry analysis to identify carbonylated sites in: BSA that had undergone in vitro MCO, and 23 carbonylated proteins in Escherichia coli. The presence of a carbonylated site rendered the neighbouring carbonylatable site more prone to carbonylation. Most carbonylated sites were present within hot spots of carbonylation. These observations led us to suggest rules for identifying sites more prone to carbonylation. We used these rules to design an in silico model (available at http://www.lcb.cnrs-mrs.fr/CSPD/), allowing an effective and accurate prediction of sites and of proteins more prone to carbonylation in the E. coli proteome. CONCLUSIONS/SIGNIFICANCE:We observed that proteins evolve to either selectively maintain or lose predicted hot spots of carbonylation depending on their biological function. As our predictive model also allows efficient detection of carbonylated proteins in Bacillus subtilis, we believe that our model may be extended to direct MCO attacks in all organisms

HAL AMU

Public Library of Science (PLOS)

HAL Descartes

Multiple Routes and Milestones in the Folding of HIV–1 Protease Monomer

Author: A Barducci
A Fersht
A Laio
A Wallqvist
AF Noel
AG Tomasselli
Alessandro Barducci
B Hess
C Chipot
C Clementi
C Dellago
D Xie
E Alm
E Shakhnovich
F Cecconi
F Ding
F Pietrucci
Francesco L. Gervasio
G Bussi
G Bussi
G Settanni
G Tiana
H Kogo
I Bahar
JC Phillips
JL Klepeis
KA Dill
L Bannwarth
L Galiano
L Holm
LK Nicholson
M Bonomi
M Bonomi
M Bonomi
M Bonomi
M Vendruscolo
M Yan
Markus J. Buehler
Massimiliano Bonomi
Michele Parrinello
MJ Bowman
MK Rout
OV Galzitskaya
P Weinkam
PC Whitford
PE Leopold
PG Wolynes
PG Wolynes
R Ishima
R Ishima
RA Broglia
RA Broglia
RA Broglia
RW Shafer
S Piana
S Rusconi
SP Lloyd
U Brandes
U Brandes
V Daggett
V Hornak
V Hornak
V Munoz
WL Jorgensen
Y Levy
Y Levy
Y Sugita
Publication venue: Public Library of Science
Publication date: 31/01/2010
Field of study

Proteins fold on a time scale incompatible with a mechanism of random search in conformational space thus indicating that somehow they are guided to the native state through a funneled energetic landscape. At the same time the heterogeneous kinetics suggests the existence of several different folding routes. Here we propose a scenario for the folding mechanism of the monomer of HIV–1 protease in which multiple pathways and milestone events coexist. A variety of computational approaches supports this picture. These include very long all-atom molecular dynamics simulations in explicit solvent, an analysis of the network of clusters found in multiple high-temperature unfolding simulations and a complete characterization of free-energy surfaces carried out using a structure-based potential at atomistic resolution and a combination of metadynamics and parallel tempering. Our results confirm that the monomer in solution is stable toward unfolding and show that at least two unfolding pathways exist. In our scenario, the formation of a hydrophobic core is a milestone in the folding process which must occur along all the routes that lead this protein towards its native state. Furthermore, the ensemble of folding pathways proposed here substantiates a rational drug design strategy based on inhibiting the folding of HIV–1 protease

Repository for Publications and Research Data

Elsevier - Publisher Connector

Public Library of Science (PLOS)

UCL Discovery

An Evolutionary Trade-Off between Protein Turnover Rate and Protein Aggregation Favors a Higher Aggregation Propensity in Fast Degrading Proteins

Author: A Ben-Zvi
A Caflisch
A Esteras-Chopo
A Stolzing
A Tonoki
A Trovato
AJ McClellan
AM Fernandez-Escamilla
AW Bryan Jr
B Aranda
CJ Proctor
D Kaganovich
DE Otzen
DL Wheeler
E Monsellier
E Monsellier
E Monsellier
E Monsellier
ET Powers
F Al-Shahrour
F Bregegere
F Chiti
F Chiti
F Rousseau
F Rousseau
F Rousseau
FA Rojas Quijano
Frederic Rousseau
G Carrard
GG Tartaglia
GG Tartaglia
GG Tartaglia
GG Tartaglia
Greet De Baets
HC Yen
HM Berman
I Medina
J Reumers
J Reumers
J Reumers
J Schymkowitz
Javier Delgado Blanco
Joaquin Dopazo
Joke Reumers
Joost Schymkowitz
Jose M. Sanchez-Ruiz
JS Hwang
JW Schymkowitz
L Goldschmidt
L Masino
LM Luheshi
M Bucciantini
M Saiki
MJ Thompson
NS de Groot
O Conchillo-Sole
OV Galzitskaya
PK Teng
R Linding
RS Rajan
S Maurer-Stroh
S Pechmann
S Ventura
S Yoon
S Zibaee
T Barrett
U Schubert
V Prahlad
WE Balch
Y Li
Y Zhu
Publication venue: Public Library of Science
Publication date: 01/06/2011
Field of study

We previously showed the existence of selective pressure against protein aggregation by the enrichment of aggregation-opposing ‘gatekeeper’ residues at strategic places along the sequence of proteins. Here we analyzed the relationship between protein lifetime and protein aggregation by combining experimentally determined turnover rates, expression data, structural data and chaperone interaction data on a set of more than 500 proteins. We find that selective pressure on protein sequences against aggregation is not homogeneous but that short-living proteins on average have a higher aggregation propensity and fewer chaperone interactions than long-living proteins. We also find that short-living proteins are more often associated to deposition diseases. These findings suggest that the efficient degradation of high-turnover proteins is sufficient to preclude aggregation, but also that factors that inhibit proteasomal activity, such as physiological ageing, will primarily affect the aggregation of short-living proteins